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CsJ Abstract 

'-^ We consider the convergence time for solving the binary consensus problem using 

the interval consensus algorithm proposed by Benezit, Thiran and Vetterli (2009). In 



the binary consensus problem, each node initially holds one of two states and the goal 
for each node is to correctly decide which one of these two states was initially held by 
a majority of nodes. 

We derive an upper bound on the expected convergence time that holds for arbi- 
trary connected graphs, which is based on the location of eigenvalues of some contact 
rate matrices. We instantiate our bound for particular networks of interest, including 
complete graphs, paths, cycles, star-shaped networks, and Erdos-Renyi random graphs; 
for these graphs, we compare our bound with alternative computations. We find that 
for all these examples our bound is tight, yielding the exact order with respect to the 
number of nodes. 

^C) We pinpoint the fact that the expected convergence time critically depends on the 

voting margin defined as the difference between the fraction of nodes that initially held 
the majority and the minority states, respectively. The characterization of the expected 
convergence time yields exact relation between the expected convergence time and the 
voting margin, for some of these graphs, which reveals how the expected convergence 

CN time goes to infinity as the voting margin approaches zero. 

Our results provide insights into how the expected convergence time depends on the 
network topology which can be used for performance evaluation and network design. 

^ The results are of interest in the context of networked systems, in particular, peer-to- 

peer networks, sensor networks and distributed databases. 



1 Introduction 

Algorithms for distributed computation in networks have recently attracted considerable 
interest because of their wide-range of applications in networked systems such as peer-to- 
peer networks, sensor networks, distributed databases, and on-line social networks. A specific 
algorithmic problem of interest is the so called binary consensus [21 [3J, HI |5] where, initially, 
each node in the network holds one of two states and the goal for each node is to correctly 

*A preliminary version without proofs of this work first appeared in pQ 
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decide which one of the two states was initially held by a majority of nodes. This is to 
be achieved by a decentralized algorithm where each node maintains its state based on the 
information exchanged at contacts with other nodes, where the contacts are restricted by 
the network topology. It is desired to reach a final decision by all nodes that is correct and 
within small convergence time. 

A typical application scenario of the binary consensus corresponds to a set of agents who 
want to reach consensus on whether a given event has occurred based on their individual, 
one-off collected, information. Such cooperative decision-making settings arise in a number 
of applications such as environmental monitoring, surveillance and security, and target track- 
ing [BJ, as well as voting in distributed systems [7]. Furthermore, it has been noted that one 
can use multiple binary consensus instances to solve multivalued consensuses; we refer to 
[H [9j for an account on such algorithms. 

We consider a decentralized algorithm known as interval or quantized consensus proposed 
by Benezit, Thiran, and Vetterli |4]. The aim of this algorithm is to decide which one of k > 2 
partitions of an interval contains the average of the initial values held by individual nodes. In 
this paper, we focus on binary interval consensus, i.e. the case k = 2. An attractive feature of 
the interval consensus is its accuracy; it was shown in jl] that for any finite connected graph 
that describes the network topology, the interval consensus is guaranteed to converge to the 
correct state with probability 1. However, the following important question remained open: 
How fast does the interval consensus converge to the final state? We answer this question 
for the case of binary interval consensus. 

The interval consensus could be considered a state-of-the-art algorithm for solving the 
binary consensus problem as it guarantees convergence to the correct consensus (i.e. has 
zero probability of error) for arbitrary finite connected graphs. Besides, it only requires a 
limited amount of memory and communication by individual nodes (only four states). Some 
alternative decentralized algorithms require fewer states of memory or communication but fail 
to reach the correct consensus with strictly positive probability. For instance, the traditional 
voter model requires only two states of memory and communication. It is however known 
that there are graphs for which the probability of error is a strictly positive constant, e.g. 
proportional to the number of nodes that initially held the minority state in the case of 
complete graphs (see [TU] for the general setting). Another example is the ternary protocol 
proposed in [3] for which it was shown that for complete graphs, the probability of error 
diminishes to zero exponentially with the number of nodes, but provides no improvement 
over the voter model for some other graphs (e.g. a path). 

In this paper, we provide an upper bound on the expected convergence time for solving the 
binary interval consensus on arbitrary connected graphs. This provides a unified approach 
for estimating the expected convergence time for particular graphs. The bound is tight in the 
sense that there exists a graph, namely the complete graph, for which the bound is achieved 
asymptotically for large number of nodes. 

We demonstrate how the general upper bound can be instantiated for a range of particular 
graphs, including complete graphs, paths, cycles, star-shaped networks and Erdos-Renyi 
random graphs. Notice that the complete graph and the Erdos-Renyi random graph are 
good approximations of various unstructured and structured peer-to-peer networks and that 
star-shaped networks capture the scenarios where some node is a hub for other nodes. 

Our results provide insights into how the expected convergence time depends on the net- 
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work structure and the voting margin, where the latter is defined as the difference between 
the fraction of nodes initially holding the majority state and the fraction of nodes initially 
holding the minority state. For the network structure, we found that the expected conver- 
gence time is determined by the spectral properties of some matrices that dictate the contact 
rates between nodes. For the voting margin, we found that there exist graphs for which the 
voting margin significantly affects the expected convergence time. 

Complete graph example For concreteness, we describe how the voting margin affects 
the expected convergence time for the complete graph of n nodes. Let us denote with a > 1/2 
the fraction of nodes that initially held the majority state, and thus a — (1 — a) = 2a — 1 is 
the voting margin. We found that that the convergence time T satisfies 

E(T) = ^ I logH(l + o(l)). 

Therefore, the expected convergence time is inversely proportional to the voting margin, and 
thus, goes to infinity as the voting margin goes to 0. Hence, albeit the interval consensus 
guarantees convergence to the correct state, the expected convergence time can assume large 
values for small voting margins. 

Outline of the Paper In Section [2] we discuss the related work. Section [3] introduces the 
notation and the binary interval consensus algorithm considered in this paper. Section [4] 
presents our main result that consists of an upper bound on the expected convergence time 
that applies to arbitrary connected graphs (Theorem [TJ. Section [5] instantiates the upper 
bound for particular graphs, namely complete graphs, paths, cycles, star-shaped networks 
and Erdos-Renyi random graphs, and compares with alternative analysis. We conclude in 
Section [6j Some of the proofs are deferred to the appendix. 

2 Related Work 

In recent years there has been a large body of research on algorithms for decentralized 
computations over networks, under various constraints on the memory of individual nodes 
and communication between the nodes. For example, in the so called quantized consensus 
problem [21 [EE] , the goal is to approximately compute the mean of the values that reside at 
individual nodes, in a decentralized fashion, where nodes communicate quantized information. 
In [5], the authors provided bounds on the convergence time in the context of averaging 
algorithms where nodes exchange quantized information. 

The work that is most closely related to ours is [3] where the authors showed that the 
so called interval consensus algorithm guarantees correctness for arbitrary finite connected 
graphs. In particular, their work shows that for solving the binary consensus problem, it 
suffices to use only two extra states to guarantee convergence to the correct consensus in 
a finite time, for every finite connected graph. Our work advances this line of work by 
establishing the first tight characterizations of the expected convergence time for the binary 
interval consensus. 
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Previous work on the binary consensus problem considered algorithms under more strin- 
gent assumptions on the number of states stored and communicated by individual nodes. 
The standard voter model is an algorithm where each node stores and communicates one 
of two states (0 or 1), where each instigator node switches to the state observed from the 
contacted node. The voter model has been studied in the context of various graph topolo- 
gies [T21 H3l IT4] and the probability of reaching the correct consensus (i.e. corresponding to 
the initial majority state) is known in closed-form for arbitrary connected graphs [10J. Specif- 
ically, the probability of reaching the correct consensus is proportional to the sum of degrees 
of the nodes that initially held the initial majority state. In particular, for the complete 
graphs, this means that the probability of reaching an incorrect consensus is proportional 
to the number of nodes that initially held the minority state. Moreover, for some network 
topologies, the convergence time of the voter model is known to be quadratic in the number 
of nodes, e.g. for a path [15]. In fact, It was shown in [16] that it is impossible to solve the 
binary consensus problem without adding extra memory to encode the states of the nodes. 
Both [4j and [T7] show that adding one additional bit of memory is sufficient. 

In [3] , the authors considered a ternary protocol for binary consensus problem where each 
node stores and communicates an extra state. It was shown that for the complete graph 
interactions, the probability of reaching the incorrect consensus is exponentially decreasing 
to as the number of nodes n grows large, with a rate that depends on the voting margin. 
Moreover, if the algorithm converges to the right consensus, then the time it takes to complete 
is logarithmic in the number of nodes n, and is independent of the voting margin. Similar 
results have been derived in [18] for the complete graph where, instead of using an extra 
state, each nodes polls more than one neighbour at a time and then chooses the opinion held 
by the majority of the polled neighbours. Notice that this is unlike to the binary interval 
consensus, for which we found that the expected convergence time, for the complete graph 
of n nodes, is logarithmic in n, but with a factor that is dependent on the voting margin and 
going to infinity as the voting margin approaches zero. The main advantage of the binary 
interval consensus algorithm over both these protocols is the guaranteed convergence to the 
correct final state with probability 1, albeit this seems to be at some expense with respect 
to the convergence time for some graphs. 

Finally, we would like to mention that in a bigger picture, our work relates to the cas- 
cading phenomena that arise in the context of social networks [19]; for example, in the viral 
marketing where an initial idea or behaviour held by a portion of the population, spreads 
through the network, yielding a wide adoption across the whole population [20J. 




3 Algorithm and Notation 

In this section, we introduce the interval consensus algorithm for the binary consensus prob- 
lem. Each node is assumed to be in one of the following four states 0, eo, e\ and 1, 
at every time instant. It is assumed that the states satisfy the following order relations 
< e < e% < 1. Let us first describe a simple example to illustrate the updating protocol. 

Example Assume that we have four nodes labelled 1,2,3,4, forming a line network 1 — 
2 — 3 — 4, starting in state (1,0,0,0). If the first interaction occurs between node 1 and 




4 



node 2 then, as they disagree, both of them become undecided. More precisely, the state 
of node 1 turns into eo indicating that she saw opinion and the state of node 2 turns 
into t\ indicating that she saw opinion 1. The new vector of states becomes (eo,ei,0,0). If 
the following interaction is between nodes 3 and 4 then nothing happens. If nodes 1 and 2 
interact again then their states are swapped, i.e. the vector of states becomes (ei,e ,0,0). 
Now suppose that nodes 2 and 3 interact then they swap their states, i.e. the vector of states 
becomes (ei, 0, eo, 0). This transition indicates that node 2 was undecided and saw opinion 
so she adopts whereas node 3 saw an undecided node so she becomes undecided. If nodes 
1 and 2 interact then the vector of states becomes (eo,0,eo,0). This indicates that node 1 
saw an undecided node so becomes undecided whereas node 2 was undecided so it adopted 
the opinion of node 1. After this stage, the dynamics does not settle as if a node in eo and 
a node in interact then they swap their opinions but the number of nodes in each of the 
states eo and stays constant which is indicative of the fact that the initial majority held 
the opinion 0. 

We now describe the set of rules for updating the states of the nodes. 

State update rules The states held by the nodes are updated at pairwise contacts between 
nodes according to the following state update rules: 

1. If a node in state and a node in state 1 get in contact, they switch their states to 
state e\ and state e , respectively. 

2. If a node in state eo and a node in state 1 get in contact, they switch their states to 
state 1 and state e±, respectively. 

3. If a node in state e\ and a node in state get in contact, they switch their states to 
state and state e , respectively. 

4. If a node in state eo and a node in state get in contact, they swap their states to state 

and eo, respectively. 

5. If a node in state t\ and a node in state 1 get in contact, they swap their states to state 

1 and state ei, respectively. 

6. If a node in state eo and a node in state e\ get in contact, they swap their states to 
state ei and state eo, respectively. 

For any other states of a pair of nodes that get in contact, their states remain unchanged. 

Temporal process of pairwise interactions We admit the standard asynchronous com- 
munication model P, [21] where any pair of nodes interacts at instances of a Poisson 
process with rate qij > 0. We denote with V — {1, 2, . . . , n} the set of nodes. The interaction 
rates are specified by the matrix Q = (qi,j)i,jev assumed to be symmetrical The transition 
matrix Q induces an undirected graph G = {V,E) where there is an edge {i,j) G E if and 
only if q it j > 0. We assume that graph G is connected. 

1 We assume that i contacts j at a rate ptj > so that i and j interact at a rate = pij + Pji- Therefore, 
defining P = (pij)ij£v, we have Q = P + P T is symmetric matrix, i.e. q^j = qjj for every i, j G V and 
qi.i — for every i g V . 
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Two convergence phases The state update rules ensure that in a finite time, a final state 
is reached in which all nodes are either in state or state eo (state is initial majority). We 
distinguish two phases in the convergence to the final state, which will be a key step for our 
analysis of the expected convergence time that relies on separately analyzing the two phases. 
The two convergence phases are defined as follows: 

Phase 1 (depletion of state 1). This phase begins at the start of the execution of the algorithm 
and lasts until none of the nodes is in state 1. Whenever a node in state and a node 
in state 1 get in contact, they switch to states e\ and eo, respectively. It is therefore 
clear that the number of nodes holding the minority state (state 1) decreases to in 
a finite amount of time and from that time onwards, the number of nodes in state 
remains equal to the difference of the (initial) number of nodes in state and state 1. 

Phase 2 (depletion of state e\). This phase follows the end of phase 1 and lasts until none of the 
nodes is in state e±. In this phase, the number of nodes in state ei decreases following 
each contact between a node in state e± and a node in state 0. Since, in this phase, no 
interaction between a pair of nodes results in increasing the number of nodes in state 
ei, there are eventually no nodes in state e\. 

The duration of each of the two phases is ensured to be finite for arbitrary finite connected 
graphs by the definition of the state update rules where swapping of the states enables that 
state 1 nodes get in contact with state nodes and similarly, enables that state e\ nodes get 
in contact with state nodes. 

Additional notation We denote by Sj(t) the set of nodes in state i G {0, e ,ei,l} at 
time t. With a slight abuse of notation, in some cases we will use the compact notation 
| | = | (0)|, i — 0,1, which should be clear from the context. We define a G (1/2, 1] as the 
fraction of nodes that initially hold state 0, assumed to be the initial majority. Therefore, 
| So | — an and |Si| = (1 — a)n. 

4 General Bound for the Expected Convergence Time 

In this section we present our main result that consists of an upper bound on the expected 
convergence time for arbitrary connected graphs. 

The bound is in terms of eigenvalues of a set of matrices Qs that is defined using the transition 
matrix Q as follows. Let S be a non-empty subset of the set of vertices V of size smaller 
than n and let S c = V \ S. We consider the matrix Q s = (qfj)i,jev that is derived from the 
contact rate matrix Q as follows 



We first establish that eigenvalues of the matrices Qs, for S C V non-empty, are strictly 
negative. This will be a key property that ensures finiteness of our bound which we present 
later in this section. 




(1) 
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Lemma 1 For every finite graph G, we define 5(Q,a) as follows 



5(Q,a) = min |A Qs | 

Scy,^e[2tt-l,a] 

> n L ' J 

= min | Aq s I - (2) 

ScV,\S\=(2a-l)n b 

where Xq s is the largest eigenvalue of Qs- Then, for all S non-empty, Xq s < and thence 
5(Q,a)>0. 

Note that identity @ is a direct consequence of the Cauchy interlacing theorem for 
principal submatrices of orthogonal matrices [22l Theorem 4.3.8, p. 185]. 

We next present our main result that establishes an upper bound on the expected con- 
vergence time that holds for arbitrary connected graphs. Before stating the result, notice 
that at the end of phase 1 none of the nodes are in state 1, (2a — l)n nodes are in state 0, 
and the remaining 2(1 — a)n nodes are in either state e$ or state e\. At the end of phase 2, 
there are exactly (2a — l)n nodes in state and 2(1 — a)n nodes in state e$. The following 
theorem establishes a general bound for the expected duration of each convergence phases in 
terms of the number of nodes n and the parameter 8(Q, a), which we introduced in Lemmajlj 



Theorem 1 Let T% be the first instant at which all the nodes in state 1 are depleted. Then, 

E(T!)<— L-(logn + l). 
6(Q,a) 

Furthermore, letting T 2 be the time for all the nodes in state e\ to be depleted, starting from 
an initial state with no nodes in state 1, we have 

E(T 2 )<— L_(logn + l). 
6\Q,a) 

In particular, if T is the first instant at which none of the nodes is in either state e\ or state 
\, then 

E(T)<— |_(logn + l). 
6(Q,a) 

It is worth noting that the above theorem holds for every positive integer n and not just 
asymptotically in n. 



The proof of the theorem is presented in Section 4^ and here we outline the main ideas. 
The proof proceeds by first separately considering the two convergence phases. For phase 1, 
we characterize the evolution over time of the probability that a node is in state 1, for every 
node % e V. This amounts to a "piecewise" linear dynamical system. Similarly, for phase 2, 
we characterize the evolution over time of the probability that a node is in state e\, for every 
given node i 6 V, and show that this also amounts to a "piecewise" linear dynamical system. 
The proof is then completed by using a spectral bound on the expected number of nodes in 
state 1, for phase 1, and in state ex, for phase 2, which is then used to establish the asserted 
results. 
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Tightness of the bounds The bound for the convergence time of the first phase asserted 
in Theorem [T] is tight in the sense that there exist graphs for which the asymptotically 
dominant terms of the expected convergence time and the corresponding bound are either 
equal (complete graph) or equal up to a constant factor (star network). The bound for the 
second phase is not tight as we prove an upper bound for it using the worst case for an initial 
configuration for the start of phase 2. 

In what follows we provide proofs for Lemma [T] and Theorem [TJ 



4.1 Proof of Lemma [T] 

Let S be a non-empty subset of V. First, for the trivial case S = V the matrix Qs is 
diagonal with diagonal elements (— Y2j E v Qij)iev which are all negative since the graph is 
connected (each node has at least one neighbour). Now let S such that \S\ < n. Note that 
every eigenvalue A and the associated eigenvector x of the matrix Q s satisfy the following 
equations 

\xi = -qiXi, for i G S , , 

Xxi = -q^i + Y,i & v %i x h for z 6 ^ 

where ft := J^ieV^h for every i G V. 

On the one hand, it is clear from the form of the matrices Qs, given by 0, that for every 
i G S, A = — qi is an eigenvalue of Qs- Since by assumption, the transition matrix Q induces 
a connected graph G, we have that for every i G V there exists a j G V such that q^ > 0. 
Hence, it follows that A < 0. 

On the other hand, if A ^ —qi for any % G S, it is clear from ^ that Xi = for every I G S. 
In fact, since Q is symmetric, the remaining eigenvalues of Qs are the eigenvalues of the 
symmetric matrix Ms = (wif,-)ijes c defined by 



-J2i e v%h i=jeS c 
Qi,j, i,jeS c ,j^i 



Let A be such an eigenvalue of Qs and let x be the corresponding eigenvector and, without loss 
of generality, assume that ||a?||l = Yli^y x ? = ^2ies c x "i = 1- Note that A = XaFx = x T Qx. 
Since Q is symmetric, we have 
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/ j Qi,j X i 
i&S c ,jeS i,j&S c 



Therefore, it is clear that A < with A = only if 

Yl + o Yl qi >i( Xi ~ x ^ 2 = °- 
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Let W C S c be such that Xi ^ 0, for i e W 7 , and Xi = 0, for z G 5 C \ VT. Since x is an 
eigenvector, then W is non empty. If A = 0, then 

iewjes iew,jes c \w i,jew 

The above implies that there are no edges between S and W, and that there are no edges 
between W and S c \ W, i.e. W is an isolated component, which is a contradiction since Q 
corresponds to a connected graph. Therefore, A < 0, which proves the lemma. 



4.2 Proof of Theorem [T] 

We first separately consider the two convergence phases and then complete with a step that 
applies to both phases. 



Phase 1: Depletion of nodes in state 1 We describe the dynamics of the first phase 
through the following indicators of node states. Let Zi(t) and Ai(t) be the indicators that 
node i is in state and 1 at time t, respectively. The indicator of being in either state eo 
or state e\ at time t is encoded by Aiit) = Zi(t) = 0. The system state evolves according 
to a continuous-time Markov process (Z(t), A(t)) t >o, where A{t) = (Ai(t))i e y and Z(t) = 
{Zi{t))i & v, with the transition rates given as follows 

{{Z — ej, A — tj) with rate q^jZiAj 
[Z — 6i + €j, A) with rate qijZi(l — Aj — Zj) 
(Z, A — 6i + Ej) with rate qijAi(l — Aj — Zj) 

where i,j £ V and ej is the n-dimensional vector whose elements are all equal to but the 
i-th element that is equal to 1. 

Since Q is a symmetric matrix, we have for every i e V and t > 0, 

jMMt)) = -J2^MMt)zM)-J2%^(Mm-Mt)-Zj(t))) 

jev 



or, equivalently, 

j t HA(t)) = -(^^)IE(A(0) + E^ E (^)( 1 -^)))- 

Kiev J jev 

Let us now consider the behaviour of the set So(t) of nodes in state 0, i.e. So(t) = {i e 
V : Zi{t) = 1}. From the above dynamics, we see that there are intervals during 
which the set So(t) does not change (the instants are stopping times of the Markov chain 
describing the evolution of the algorithm). Let C V be the set of nodes in state 0, for 
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t G [tk,t k+ i), and let S% = V\ S k , i.e. S (t) = S k and V \ S (t) = S%, for t G [t k ,t k+1 ). We 
then can write, for t G [t k ,t k+ i), 

|^W) = -few)^W«)) + {^* , ' auE ' (i4,( ' )) ' -^ (5) 

where is the expectation conditional on the event {So(t) = S k }. In a matrix form, this 
gives 

jM k (A{t)) = Q Sk TE k (A(t)), for t k < t < t k+1 , 
where Qs k is given by ([!]). 

Solving the above differential equation, we have 

E fc (A(t)) = e Q ^ ( *-' fc) lE fc (A(t fe )), for t k < t < t k+1 . 
Using the strong Markov property, it is not difficult to see that 

JE(A(t)) = IE [e A(i) A(0)] , for t > 0, 

where 

k-i 

Kt) = Q Sk {t-t k ) +J2Qsi(U+i ~ti), for t k <t < t k+1 . 

1=0 

Note that X(t) is a random matrix that depends on the stopping times t k . 



Phase 2: Depletion of nodes in state ei To describe the dynamics in the second phase, 
let Bi(t) be the indicator that a node i G V is in state e\ at time t. The notation Zi(t) has the 
same meaning as in phase 1, thus Zi(t) is the indicator that node i G V is in state at time 
t. The indicator that a node i G V is in state eo at time t is encoded by Bi(t) = Zi(t) = 0. 
The dynamics in this phase reduces to a continuous-time Markov process (Z(t), B(t)) t > , 
where Z{t) = (Zi{t)) ie y and B(t) = (_Bj(t))j e y, with the transition rates given as follows, for 
i,j G V, 

{(Z — e.j + €j, B — 6j) with rate qijZiBj 
[Z — ei + tj, B) with rate qi : jZi(l — Bj — Zj) 

(Z, B — 6i + tj) with rate qi : jBi(l — Bj — Zj). 

From this, we have for every i e V and t > 0, 
d 



dt 



Therefore, for every i G U and t > 




E(^(t)) = - (5>>0 + (^(0(1-^(0)) ■ 
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Similar to the first phase, we see that there are intervals [t' k ,t' k+1 ) during which the set So(t) 
does not change (the instants t' k are stopping times). Let S' k be such that So(t) = S' k , for 
t E [t' k ,t k+1 ). Similarly to the first phase, we have 

E(£(t)) = IE fe^B^o)] , for t > 0, 

where X'(t) is a random matrix given by 

fe-i 

= Qs' k (t - 4) + ^Qstft+i - t\), for t' k <t< t' k+1 . 
1=0 

Note that t' = T\ is the instant at which phase 2 starts (phase 1 ends). 

Duration of a phase In both phases, the process of interest is of the form 

E(Y(t)) = IE [e m Y(0)] , for t > 0, 

where for a (random) positive integer m > and a sequence = t < tx < ■ ■ ■ < t m , we have 

fc-i 

Hi) = Qs k {t - t k ) + S ^Q Sl {ti+i - U), for t k <t< t fc +i, k = 0, 1, . . . ,m- 1. 

For phase 1, F(t) = while for phase 2, = -B(t). Using techniques as in [23|i Chapter 
8], we have for or every t > 0, 

||iE(r(t))|| 2 <iE[|| e A Wr(o)|| 2 ] < E[|| e A W|| ||y(o)|| 2 ] 



< E 



fe-i 



sAt-t k )\ 



J J || e Qs,(ti+i-t/)|| ||y 



z=o 



< e- 5 ( Q ' a )*IE(||y(0)|| 2 ) < v^e- 5(Q - Q)i 

where 1 1 ■ 1 1 denotes the matrix norm associated to the Euclidean norm 1 1 • 1 1 2 . In the above, we 
used Jensen's inequality in the first inequality, followed by the property of matrix norms for 
the second and third inequalities, then Lemma [T] and finally the fact that Y is a n-dimensional 
vector with elements taking values in {0, 1}. 

Furthermore, combining with Cauchy-Schwartz's inequality, we have 

^2~E(Yi(t)) < \\E(Y(t))\\ 2 ||1|| 2 <ne 4(M , for every t > 0. 

iev 

where 1 = (1, . . . , 1) T . Therefore, we have 

JP(X(t) ^ 0) < E ( y i(*)) < n e~ S{Q ' a)t , for every t > 0. 

i&V 
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Let To be the time at which Y(t) hits = (0, . . . , 0) T , which corresponds to T\ for the process 
A(t) and T2 for the process B(t). Then, we have 



/»oo /»oo 

E(T ) = / P(T > t)dt = / P(y(t) 7^ 0)dt 



log(ra) 
<5(Q, a 

log(n) + 1 



d(Q, at) 7 i°g(") 



oo 



<5(<3,q) 



<f(Q,a) 

which completes the proof of the theorem. 



5 Application to Particular Graphs 

In this section we instantiate the bound of Theorem [I] for particular networks including 
complete graphs, paths, cycles, star-shaped networks and Erdos-Renyi random graphs. For 
all these cases, we compare with alternative computations and find that our bound is of 
exactly the same order as the expected convergence time with respect to the number of 
nodes. For the complete graph, we also examine the expected convergence time as the voting 
margin goes to zero. 



5.1 Complete Graph 

We consider the complete graph of n > 1 nodes where each edge e £ E is activated at in- 
stances of a Poisson process with rate l/(n — 1), i.e. we have q^j — l/(n— 1) for all i,j £ V 
such that i ^ j. 

Lemma 2 For the complete graph of n > 1 nodes and every fixed a £ (1/2, 1], we have 

5(Q,a) > (2a - 1). 



Proof: For the complete graph, the matrix Q$ is as follows 

r -1, t=j 

<& = { ^ ie&j^i 
{ 0, ieSj^i. 

First of all —1 is an eigenvalue of order \S\. In addition it is not difficult to see that the 
vector x such that X{ = for i £ S and xi = 1 for i £ 5* c is an eigenvector of matrix Qs 
with the eigenvalue — -^t. Since in each of the two convergence phases, the matrices Qs k are 

such that \Sk\ > (2a — l)n, we have -^4 > (2a — 1)-^. Finally note that the remaining 
eigenvalues are the eigenvalues of the matrix Ms = (nT>f j)i,jeS c defined by 

s _ ( -1, t = jeS c 
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One can rewrite Ms as Ms = —^zjl + ^Z\J > where / is the identity matrix and J is the 
matrix with all its entries equal to 1. Therefore the remaining eigenvalue of Qs is — with 
mutiplicity n — |S| — 1 and S(Q, a) > (2a — 1). I 
Combining the last lemma with Theorem [TJ we have the following corollary. 



Corollary 1 For the complete graph of n > 1 nodes, the expected duration of phase i = 1 
and 2 satisfies 

E(T0<2^(log(n) + l). 



In the following, we will show that the latter bound is asymptotically tight, for large number 
of nodes n, for convergence phase 1. 

Comparison with an alternative analysis For complete graphs, the convergence time 
can be studied by an analysis of the underlying stochastic system that we describe in the 
following. 

We first consider the convergence phase 1. Let = To < T\ < • • • < t\s x \ denote the time 
instances at which a node in state and a node in state 1 get in contact. Recall that |So| 
and | Si | denote the initial number of nodes in state and state 1, respectively. It is readily 
observed that |So(i)| = |So| — i and |Si(t)| = |Si| — i, for r» <t < r i+ \ and 1 < i < \S\\. 
It is not difficult to observe that r i+1 — r^, i — 0, 1, . . . , |Si| — 1, is a sequence of independent 
random variables such that for each < i < |Si|, r i+ i — T{ is a minimum of a sequence of 
(\Sq\ — i)(\Si\ — i) i.i.d. random variables with exponential distribution with mean n — 1. 
Therefore, the distribution of Tj+i — Tj is exponential with mean for < % < |Si|, where 
Hi — (|So| — i)(|Si| — i)/(n — 1). In particular, we have E(Ti) = ^i=o _1 fh 1 ' 1 - e - 

\Si\-l 

(Pol -i)(\Si\ -i) 
The above considerations result in the following proposition. 



Proposition 1 For every complete graph with n > 1 nodes and initial state such that \Sq\ > 
| Si | > 0, the duration of the first convergence phase is a random variable T\ with the following 
expected value 

Tii — 1 

E ( T l) = Tq~\ — fcT ( H \ s ^\ + H \So\-\Si\ - ^|5 |) (7) 
Pol — 

where Hk = Y2i=i \- Furthermore, for every fixed a E (1/2, 1], we have 

E(T 1 ) = ^ T log(n) + 0(l). 
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Figure 1: Complete graph: the expected duration of convergence phases vs. the number of 
nodes n. The initial majority state is held by \otn\ nodes, where a = 3/4. The solid line 
is the asymptote log(n)/(2a — 1) while the bars are simulation results with 95% confidence 
intervals. 



From the result of the proposition, we observe that the expected duration of the first 
phase is log(n) /5(Q, a), asymptotically for large n, where 5(Q, a) is given in Lemma |2j thus 
matching the upper bound of Theorem [TJ 

For the prevailing case of a complete graph, we can characterize the expected convergence 
time of the first phase as a approaches 1/2, i.e. as the voting margin 2a — 1 approaches 
from above. We first consider the limit case where initially there is an equal number of nodes 
in state and state 1, i.e. |So| = \S\\. From ([6]), it is straightforward to note 

E(T0 = yn(l + o(l)). 

Therefore, we observe that in case of an initial draw, i.e. equal number of state and state 1 
nodes, the expected duration of the first phase scales linearly with the network size n. Note 
that in this case, the second phase starts with nodes in state eo and state e\ and obviously 
no majority can follow. 

We now discuss the case where \S \ — I Si I is strictly positive but small. To this end, let 
fj, n denote the voting margin, i.e. /i n = (|5 | — \Si\)/n. From ([7]), is easy to observe that 

E(Ti) = — log(n//„) + 0(l). 

Therefore, we note that for the voting margin \x n = 0(l/n), IE(Ti) = 0(n) while for \i n > 
a fixed constant, we have IE(Ti) = 0(log(n)). For the intermediate values of the voting 
margin, say for fi n = l/n a , for < a < 1, we have E(Ti) = ^^ra a log(n). 

Finally, we compare the bound log(n)/(2a — 1) with simulation results, in Figure [T] We 
observe that the bound is tight for phase 1 and not tight for phase 2, due to our choice of 
the initial condition in phase 2. Note also that Figure [l] indicates that the expected duration 
of convergence phase 2 scales as 6 (log(logn)). 
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Figure 2: Path: the expected duration of convergence phases vs. the number of nodes n, for 
a = 3/4. The solid line is the asymptote in Corollary [2] while the bars are simulation results 
with 95% confidence intervals. 



5.2 Paths 

We consider a path of n > 1 nodes where each edge is activated at instances of a Poisson pro- 
cess of rate 1. Therefore, the contact rate matrix Q is given by <2v+i = 1, for i — 1, . . . , n — 1, 
q^i-i — 1, for i = 2, 3, . . . , n, and all other elements equal to 0. 



Lemma 3 For a path of n > 1 nodes, we have, for a G (1/2, 1], 



5(Q,a) = 2(l-cos 

'2 



7T 



7T 



16(1 - a) 2 n 2 



4(1 - a)n + 1 

1 + 0(1)). 



The proof is provided in Appendix |A.l| The previous lemma, together with Theorem [T], 
yields the following result. 



Corollary 2 For a path of n > 1 nodes and a G (1/2, 1), we have for phase i = 1 and 2, 

< 16(1 7 Q) V log(n) + 0(1). 



Finally, we compare the asymptotic bound with simulation results in Figure [2] The results 
indicate that the bound is rather tight for phase 1 and is not tight for phase 2. 
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Figure 3: Cycle: the expected durations of convergence phases vs. the number of the nodes 
n. The initial state is such that state is held by a set of \an\ consecutive nodes along the 
cycle with a = 3/4. 



5.3 Cycles 

We consider a cycle of n > 1 nodes where each edge is activated at instances of a Pois- 
son process with rate 1. Therefore, the contact rate matrix Q is given by q^i+i = 1, for 
i = 1, . . . , n — 1, qi t i-i = 1, for i = 2, 3, . . . , n, qi >n = q n> i = 1, and all other elements equal to 
0. 



Lemma 4 For a cycle network of n > 1 nodes, we have, for a G (1/2, 1], 

6{Q,a) = 2(1 — cos 



7T 



71' 



4(1 - a) 2 n 2 



2(1 -a)n + l 
;i + o(l)). 



The proof is provided in Appendix |A.2 



Corollary 3 For the cycle with a G (1/2,1), we have for phase i = 1 and 2, 



E(Ti) < 



4(1 - a) 



71" 



-n 2 log(n) + 0(1). 



Finally, we compare the last bound with simulation results in Figure [3j Similar as in 
other cases, we observe that the bound has the same scaling with the number of nodes as 
the expected duration of convergence phase 1, and is not tight for convergence phase 2. 
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5.4 Star-Shaped Networks 

We consider a star-shaped network that consists of a hub node and n — 1 leaf nodes. Without 
loss of generality, let the hub node be node 1 and let i = 2, 3, . . . , n be the leaf nodes. The 
contacts between a leaf node and the hub are assumed to occur at instances of a Poisson 
process of rate l/(n— 1). This setting is motivated in practice by networks where a designated 
node assumes the role of an information aggregator to which other nodes are connected and 
this aggregator node has access capacity of rate 1. The elements of matrix Q are given by 
Qi,i — Qi,i = l/(n — 1), for i = 2, 3, . . . , n and other elements equal to 0. 

We have the following lemma for the star-shaped network that we defined above. 



Lemma 5 For the star network of n > 1 nodes, we have 

S(Q,a) 



n 



> 



2(n-i; 
2a - 1 
n 



4(2a - 1) 



n 



where the inequality is tight for large n. 



The proof is provided in Appendix A. 3 The previous lemma yields the following corollary. 



Corollary 4 For the star network with n > 1 nodes and every fixed a G (1/2, 1], the expected 
duration of phase i = 1 and 2 satisfies 



2a -1 



n(log(n) + 1). 



Comparison with an alternative analysis for phase 1 For the star-shaped network of 
n nodes, we can compute the exact asymptotically dominant term of the expected duration 
of phase 1, for large n, which is presented in the following proposition. 

Proposition 2 For the star-shaped network of n nodes, the expected time to deplete nodes 
in state 1 satisfies 



E(Ti 



(2a- 1) (3 -2a) 



n log(n) + 0(n). 



(8) 



The proof is provided in Appendix [Bj 

Notice that the dominant term in Proposition [2]is smaller than the upper bound in Corollary|4] 
for the factor 1/(3 — 2a). 
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2<x-1 2<x-1 

Figure 4: Star-shaped network: expected duration of convergence phases versus the voting 
margin 2a — 1, for n = 1000. The solid curves indicate log(n)/(2a — 1); the dashed line 
indicates log(n)/[(2a — 1)(3 — 2a)]; the bars indicate 95%-confidence intervals of estimates 
obtained by simulations. 



Remark We only consider the expected convergence time for phase 1. Similar analysis 
could be pursued for phase 2 but is more complicated, because the lumping of the states as 
done in the proof for phase 1 cannot be made. 

Finally, we compare our bound with simulation results in Figure |4j The results indicate that 
the bound of Corollary [4] is not tight. We also observe that the asymptote in Proposition [2] 
conforms well with simulation results. 



Qi,j r ,\ i,j 



5.5 Erdos-Renyi Random Graphs 

We consider random graphs for which the matrix of contact rates Q is defined as follows. 
Given a parameter p n G (0, 1) that corresponds to the probability that a pair of nodes interact 
with a strictly positive rate, we define the contact rate of a pair of nodes i, j G V, i ^ j, as 
follows 

1 

(n - l)p, n 

where X^j is a sequence of i.i.d. random variables such that JP(Xij = 1) = 1 — P(Xij = 
0) = Pn, for every i,j G V, j ^ i. The rates are normalized with the factor l/(n — l)p n , so 
that for each node, the interaction rate with other nodes is 1. 

Furthermore, we assume that p n is chosen such that, for a constant c > 1, 

log(n) 

Pn = c 

n 

which ensures that the induced random graph is connected with high probability. 
We have the following lemma. 

Lemma 6 Suppose c > an d a ^ (1/2? 1]- We then have, with high probability that 

6(Q, a) > (2a - l)^ 1 ( - 2 - ) + O ( , (9) 



c{2a — 1) J \logn 
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Figure 5: Erdos-Renyi random graphs: the expected duration of convergence phases vs. the 
voting margin 2a — 1, for n = 1000 and c = 100. The solid curves indicate the bound of 



Corollary 10 the dashed lines indicate log(n)/(2a — 1); the bars indicate 95%-confidence 
estimates. 



where tp is the inverse function of ip(x) = x\og(x) + 1 — x, for x G [0, 1]. 
The proof is provided in Appendix A.4| 

From the last lemma and Theorem [TJ we have the following corollary. 

Corollary 5 Under c > 2~i an d en G (1/2, 1] ; we have for the duration of phase i = 1 and 
2, 

% < — ^ log(n) + 0(1) (10) 

with high probability. 



Remark We note the following intuitive observation: the asserted bound for the expected 
convergence time for each of the phases boils down to that of the complete graph, for large 
expected degree of a node, i.e. large c. Indeed, this holds because for every fixed a G (1/2, 1], 
the term V 9 ~ 1 ( e (2a-i) ) & oes to 1 as c grows large. 

Finally, we compare the bound of Corollary [5] with estimates obtained by simulations in 
Fig. [5] The results confirm that the bound is indeed a bound and that it is not tight, which 
is because of a bounding technique that we used in the proof. 



6 Conclusion 

We established an upper bound on the expected convergence time of the binary interval con- 
sensus that applies to arbitrary connected graphs. We showed that for a range of particular 
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graphs, the bound is of exactly the same order as the expected convergence time with respect 
to the network size. The bound provides insights into how the network topology and the 
voting margin affect the expected convergence time. In particular, we showed that there 
exist network graphs for which the expected convergence time becomes much larger when 
the voting margin approaches zero. The established bound provides a unifying approach to 
bound the expected convergence time of binary interval consensus on arbitrary finite and 
connected graphs. 

An important direction of future work is to consider lower bounds on the convergence time. 
In particular, it would be of interest to better understand how to fine tune the interaction 
parameters qij to achieve the best possible convergence time for a given connected graph 
under given memory and communication constraints. 
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A Characterization of 5 for Particular Graphs 



A.l Proof of Lemma [3] (Path) 

Recall that the matrix Q is the tridiagonal matrix with q^i+i = 1, for i = 1,2, . . . ,n — 1, 
q^i-i = 1, for i = 2, 3, . . . , n, and all other elements equal to 0. 

We will separately consider four cases depending on whether the respective end-node 1 
and n is in S or S c . In the following, we denote with £a(A), the characteristic polynomial of 
a matrix A. 



Case 1 : 1 G S and n G S* c . In this case, we repeatedly expand the matrix XI — Q$ along the 
rows i G S, using Laplace's formula, to obtain 



£ Qs (A) = (A + l)(A + 2)l s l^ B (A) 

where the matrix B is a block-diagonal matrix with blocks Bi, B 2 , . 
that are symmetric tridiagonal matrices of the form 



(11) 



B h , for 1 < b < \S\ 



( -2 + Cl 1 
1 -2 10 

1-2 1 























\ 



(12) 



1 -2 + c 2 / 



where Ci = c 2 = 0, for % — 1, 2, . . . , b — 1 (type 1), and Ci = and c 2 = 1, for i — b (type 2). 

Since B is a block-diagonal matrix, notice that £b(A) = E[i=iCBi(A)- Hence, together 
with (11), we have that Qs the largest eigenvalue of a block that is either of type 1 or type 
2. 

The eigenvalues of tridiagonal matrices of the form ( 12 ) are well known for some values 
of the parameters C\ and c 2 , see e.g. (23]. In particular, for a m x m tridiagonal matrix of 
type 1, i.e. for c\ = c 2 = 0, we have eigenvalues 



Ai 



-2 1 



cos 



7T 



k 



1, 2 m. 



+ 1 

For a m x m tridiagonal matrix of type 2, i.e. for c\ = and c 2 = 1, we have eigenvalues 

(2k - l)?r 



(13) 



-2 1 



cos 



2m + 1 



fc = 1, 2 m. 



It is readily checked that the largest eigenvalue is K\ with m = \S C \. This corresponds to 
the case where the nodes in the set S are 1, 2, . . . , \ S\ (i.e. form a cluster). 



Case 2: 1 G S c and n G S. In this case, we have that (11) holds but the blocks of matrix B 
redefined so that c\ — 1 and c 2 = 0, for B±, and c\ = c 2 = 0, for £? 2 , _B 3 , . . ., and Bf,. It is 
readily observed that the eigenvalues of the block matrix B\ are the same as for c\ = and 
c 2 = 1 (type 1 tridiagonal matrix in Case 1). Hence, the largest eigenvalue is same as under 
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Case 1. Notice that in this case, it corresponds to taking nodes n — \S\ + 1, n — \S\ + 2, . . ., 
and n to be in the set S. 



Case 3: 1 G S° and n G S°. In this case, by the same arguments as in Case 1, we have 

fc fl (A) = (A + 2)l%,(A) 

where B is a block-diagonal matrix with blocks Bi, B 2 , . . . , -Bfe, 1 < b < \S C \, which are of 



the form (12) such that C\ = 1 and c 2 = for B±, c\ = c 2 = 0, for B 2 , B 3 , . . ., and B b , and 



c\ = and c 2 = 1, for In this case, the largest eigenvalue is /ci with m — \S C \ — 1. 

Case ^ : 1 G 5 and n G S. In this case, we have 

WA) = (A + l) 2 (A + 2fl- 2 ^(A) 
where is a block-diagonal matrix with blocks B\, B 2 , . . . , Bb, 1 < b < \S C \ that are all of 



the form (12) with C\ = c 2 = 0. In this case, the largest eigenvalue is Ai with m = \S C \. 
Finally, we observe that for each of the four cases, since \S C \ < 2(1 — a)n, we can take S(Q, a) 
as asserted in the lemma, which completes the proof. 

A. 2 Proof of Lemma [4] (Cycle) 

The proof is similar to that for a path in Section |A.1[ We will see that for the cycle, we 



will deal with blocks of tridiagonal matrices of the form (12) with c\ = c 2 = 0. Recall that 
for the cycle, we have matrix Q such that q^i+x = 1, for i = 1, 2, . . . , n — 1, qi-ij = 1, for 
i — 2, 3, . . . , n, qi 7n = q n ^ = 1, and all other elements equal to 0. 
Again, we separately consider the following four cases. 

Case 1 : 1 G S and n G S c . By successive expansion along rows i G S, we obtain 

e Qs (A) = (A + 2)l^ B (A) (14) 

where B is a block-diagonal matrix with blocks of the form ( Jl2| ) with c\ = c 2 = (referred to 
as type 1). Since B is a block-diagonal matrix, we have £b(A) = E[i=iCsi(A)) 1 < b < \S C \, 

c I 



where Bi is a matrix of type 1. The largest eigenvalue is Ai, given in (|13j), for m = \S' 
Case 2: 1 G S c and nsS. In this case, the same arguments hold as in Case 1. 



Case 3: 1 G S c and n G S c . In this case, (14) holds, with matrix B with diagonal blocks and 
other elements as in Case 1, except that 6i,|s c | — b\s c \,i — 1- This matrix can be transformed 
into a block-diagonal matrix of the same form as in Case 1 by permuting the rows of the 
matrix, hence, it has the same spectral properties as the matrix B under Case 1. Specifically, 
this can be done by moving the block of rows that correspond to B\ to the bottom of the 
matrix B. 

Case 4 '■ 1 G S and n G S. In this case, the same arguments apply as in Case 1. 

Finally, we observe that for each of the four cases, since \S C \ < 2(1 — a), we can take 
S(Q, a) as asserted in the lemma, which completes the proof. 
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A. 3 Proof of Lemma [5] (Star) 

We separately consider the two cases for which either the hub is in the set S or not. 



Case 1 Suppose that the hub is in the set S, i.e. 1 G S. It is easy to observe that in this 
case, the matrix Qs is a triangular matrix with all upper diagonal elements equal to 0, and 
the diagonal elements equal to (—1, — -^r, • • • , — ~zi)- Hence, the largest eigenvalue is — — j-. 

Case 2 Suppose now that the hub is not in the set S, i.e. 1 G S c . If A is an eigenvalue of 
Qs with an eigenvector x, then we have 

Axi = -Xi H — - Xi 

n — 1 / — ' 

ies c \{i} 

Ax, = H — '— , for*GS c \{l} 

n — 1 n — 1 

Xxi = Xi, for i £ S. 

n — 1 



This implies 



Axi = -xi H 

n — 1 *— ' 

ies c \{i} 

acj = ((n - 1)A + l)x i; foriGS c \{l} 

Axj = Xj, for i & S. 

n — 1 

Suppose that x is such that Xj = 0, for every z G S. From the last above identities, it readily 
follows that A is a solution of the quadratic equation 

, 2 n , If \S c \-l\ 

A 2 + -A + i_L_L_^] =0 . 

n — 1 n — 1 \ n — 1 

It is straightforward to show that the two solutions are 



A 1^ l-Jl-M andA 1^ l + ,l- 4lSI 



2 n — 1 \ V n 2 / 2 n — 1 \ V ^ 2 

Clearly, the largest eigenvalue is Ai and since \S\ > {2a — l)n, it is maximized for = 
(2a - l)n. 

Finally, we note that the largest eigenvalue is attained in Case 2, which establishes the first 
equality in the lemma, from which the asserted inequality and its tightness readily follow. 

A. 4 Proof of Lemma [6] (Erdos-Renyi) 

From (|4]), note that for every S C V such that < \S\ < n, if A is an eigenvalue of matrix 
Qs, then 



A = - 

ieS c ,jeS 



s c ,j&s i,jes c % lies J 
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since Yli^s c x 1 ~ 1- m ^he following, we would like to find a value x n > such that 
min ig 5 C "fX^es ftj} > x n holds with high probability. To this end, we consider the proba- 
bility that the latter event does not hold, i.e., for x n > 0, we consider 

p e :=JP ^mmj^gyj < x r . 
We first show that the following bound holds p e < p e with 



% = 2(1 - a)nexp [ -(2a - l)(n - l)p n <p [ ) ) (15) 



X t 



where we define <p(x) = xlog(x) + 1 — x, for x > 0. 
To see this, note that for every fixed 9 > 0, 



Pe 



VieS c lies' J / Vies y 



|S c |e ten (l+p„ (e~c«=W-l 



|5| 



where the first inequality follows by the union bound, the second inequality by the Chernoff's 
inequality, and the third equality by the fact that is a Bernoulli random variable with 
mean p n . 

Since |5 C | < 2(1 — a)n and \S\ > (2a — l)n, we have 

at / 9 \\(2Q!-l)n 

p e < 2(1 - a)ne to ™ ( 1 + p n (e (— Dpn-ljj 
Furthermore, using the fact 1 + p n (e f™- 1 ^ — 1) < exp(p n (e (»- 1 )pn — it follows 

&r n +(2a-l)np n f e _ -l^ 

Pe < 2(1 — a)ne V / . 

It is straightforward to check that the right-hand side in the last inequality is minimized for 



9 = — (n — l)p„log (ofz?) an d fo r this value is equal to p e given in (15). 
Requiring p e < - is equivalent to 



x n \ ^ 21og(n)+log(2(l-a)) 



> 



2a — 1 J (2a — l)np v 



From this it follows that A < — x n holds with high probability provided that x n > can be 
chosen such that 

( x n \ . 2 / log(2(l-q)) ^ 
H2^lJ-^^T)l 1+ 21og(n) J' (16) 
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Such a value x n exists as <p{x) is a decreasing function on [0, 1], with boundary values <^(0) = 
and <f(l) = 0, and under our assumption, c(2a — 1) > 2, the right-hand side in (16) is smaller 
than 1 for large enough n. 



Finally, from (16), we note 



x n > (2a - l)(p 



-i 



c(2a - V 



O 



log(n) 



from which the asserted result follows. 



B Proof of Proposition 2 



Let H (t) denote the state of the hub at time t. Due to the symmetry of the considered graph, 
it is not difficult to observe that the dynamics is fully described by a continuous-time Markov 
process (H(t), \So(t)\, |Si(i)|, |S eo (t)|, \S ei (t)\) t >o- We need to compute the expected value of 
the smallest time t such that |Si(£)| = 0, i.e. the time when the state 1 becomes depleted. To 
this end, it suffices to consider (H(t), \So(t)\, |Si(£)|, |S e (£)|) where S e (t) = S eo (i) US ei (t), i.e. 
the system states eo and e\ are lumped into one state, which we denote with e. The system 
will be said to be in mode % at time t, whenever the number of depleted state 1 nodes before 
time t is equal to i, for % = 0, 1, . . . , |Si(0)| — 1. Notice that if the system is in mode % at time 
*, then |S (t)| = |5b(0)| - i, \Si(t)\ = |5i(0)| - i, and \S e (t)\ = \S e (0)\ + 2i. For simplicity, 
we will use the following notation x l = | So (0)1 — i, x\ = |Si(0)| — i, and x\ = |S e (0)| + 2i, for 
every given mode i. 

We will compute the expected sojourn time in each of the modes by analyzing a discrete- 
time Markov chain 0* = {<p l k )k>0i for given mode i, defined as follows. This Markov chain 
is embedded at time instances at which the hub node interacts with a leaf node. The state 
space of (jf consists of the states 0, 1, e, e* with the transition probabilities given in Figure [6} 
The state of the Markov chain (jf indicates the state of the hub at contact instances of the 
hub with the leaf nodes, where we introduced an extra state e* to encode the event where 
the hub is in state e and that this state was reached by the hub from either state or 1, thus 
indicating a depletion of state 1. Note that the expected duration of mode is equal to the 
mean hitting time of state e* for the Markov chain 0° started at either state or 1, while the 
expected duration of mode i, for < i < |Si(0)|, is equal to the mean hitting time of state 
e* for the Markov chain (jf started at state e. We compute these mean hitting times in the 
following. Fix an arbitrary mode i, and then let <p s (i) be the mean hitting time of state e* 
for the Markov chain <jf started at state s — 0, 1, and e. By the first-step analysis, we have 
that the latter mean hitting times are the solution of the following system of linear equations 





x l — 1 / 


) + 


+ 1 




= Afoii) 






</>i(0 






+ 1. 



(17) 
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Figure 6: Star-shaped network: the transition probabilities of 



From this, it is straightforward to derive 

<Po(i) = {n -I) 



(18) 
(19) 
(20) 

: cm, 

x\ = (1 — a)n, x° = 0, and thus, from (18) and (19), the expected sojourn in mode is <^o(0) 



n 



Zo^xC 77, — X l))( n + X e) 

x l x\(n — x\)(n + x l e ) 

9 i i 

a^a^n + X*) 



The expected sojourn time in each given mode is as follows. For mode % — 0, it holds x\ 



and <^i(0), for the initial state of the hub equal to and 1, respectively, where 

1 , , , n - 1 



Vo(0) 



and <^i(0) 



an 



(21) 



On the other hand, for < % < \Si(0)\, the expected sojourn time in mode i is f e (i), and 



from (20) and x l = \S (0)\ 



l^i (0)1 



\s f 



+ 2i, we have 



in 



n 



(an — i)((l — a)n — i)(n + 2i) n + 2i 



(l-a)n-l 



<PeW 



(22) 



where s denotes 



Finally, the expected duration of phase 1 is equal to <p s (0) + Yli=i 
the initial state of the hub, either or 1. On the one hand, from (21), we have that for 



every fixed a G (1/2, 1], both <fo(0) and <fi(0) are asymptotically constants, as the number 



of nodes n grows large, thus y?o(0) = 0(1) and y?i(0) = 0(1). On the other hand, using (22) 
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and some elementary calculus, we obtain 

(1 ~ Q?)n ~ 1 - 1 - 1 

E Ve(i) = _ n i)(3 _ 2a) - (2a + 2a) " #(2a-l)n] 

/ 2 2 _ \ (1 -^T 1 n-l 

+ V(2a- l)(3-2a) (2a - 1)(1 + 2a) / ^ n + 2i' 

where, recall, H k = Yli=i \- From this, it can be observed that 



(l-a)ra-l 

E ^(»)= ^_ 1V3 _ 9 -, ntog(n) + Q(n). (23) 



(2a-l)(3-2a) 
This completes the proof of the proposition. 
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